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Abstract 

The setting of metric spaces is very natural for numerous questions 
concerning manifolds, norms, and fractal sets, and a few of the main 
ingredients are surveyed here. 
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1 Metric spaces 

A metric space is a set M equipped with a function d{x, y) defined for a;, y G M 
such that d{x, y) is a nonnegative real number that is equal to exactly when 

x = y, 

(1.1) d{y,x)=d{x,y) 
for every x,y G M, and 

(1.2) d{x,z) <d{x,y)+d{y,z) 

for every x,y,z € M, which is known as the triangle inequality. 

Remember that the absolute value of a real number r is denoted |r| and 
equal to r when r > and to — r when r < 0. It is easy to check that 

(1.3) |r + t| < |r| + |i| 
and 

(1.4) \rt\ = \r\\t\ 

for any pair of real numbers r, t. The standard metric on the real line R is 
given by |r — i|, which is the first main example of a metric space. 

If {M,d{x,y)) is a metric space, then d{x,y) is called the distance function 
or metric on M. For each x & M and r > 0, the open ball in M with center x 
and radius r is 

(1.5) B{x,r)^{yeM ■.d{x,y)<r}. 
Similarly, the closed ball with center x and radius r > is 

(1.6) 'B{x, r) = {y€M: d{x, y) < r}. 
Thus 

1 .7 B{x, r) C B{x, r) C B{x, t) 

when r <t. 



2 



Let a, b be real numbers with a <b. The open interval {a, b) in R is defined 

by 

(1.8) (0,6) = {r e R: a < r < 6}, 
and the closed interval [a, b] is defined by 

(1.9) [a,6] = {r G R : a < r < 6}. 

One may also allow a = 6 for the latter. The length of these intervals is 6 — a. 
Note that open and closed balls in the real line with respect to the standard 
metric are open and closed intervals. 

2 A little calculus 

Suppose that a, b are real numbers with a < 6, and that f{x) is a continuous 
real-valued function on the closed interval [a, b] in the real line. The extreme 
value theorem states that there are elements p, q of [a, 6] at which / attains its 
maximum and minimum, which is to say that 

(2.1) f{q) < fix) < m 

for every x G [a,b]. This works as well for continuous real- valued functions on 
compact subsets of metric spaces, or even topological spaces. If p or g is in the 
open interval (a, 6) and / is differentiable there, then the derivative f'{p) or 
f'{q) is equal to 0. 

Suppose that f{x) is differentiable at every point in (a, b). If /(a) = /(&) = 0, 
then Rolle's theorem states that f'{x) = for some x G (a, 6). This is because 
the maximum or minimum of / on [a, 6] is attained on (a, 6), or f{x) = for 
every x G [a,b]. No matter the values of /(a), f{b), the mean value theorem 
says that there is an x G (a, b) such that 

(2.2) fix) = ffl^^. 

This follows from Rolle's theorem applied to / — 0, where <l){x) = ax + (3 and 
a,/3 G R are chosen so that 0(a) = /(a), (p{h) = f(b). 

Of course, the derivative of a constant function is 0, and the mean value 
theorem implies that a continuous function / on [a, b] is constant if the derivative 
of / exists and is equal to at every point in (a, 6). If / is monotone increasing 
on [a,b], in the sense that f{x) < f{y) when a < x < y < b, then f'{x) > 
for every x G (a, b) at which / is differentiable. Conversely, if / is continuous 
on [a,b], differentiable on (a, 6), and f'{x) > for each x G (a, 6), then / is 
monotone increasing on [a, b], by the mean value theorem. If f'{x) > for every 
x G (a, b), then / is strictly increasing on [a, b], in the sense that f{w) < f{y) 
when a < w < y < b. However, the derivative of a strictly increasing function 
may be equal to 0, as when f{x) = x^. 
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3 Norms on R" 

Let n be a positive integer, and let R" be the space of n-tuples of real numbers. 
This means that an clement x of R" is of the form x = (xi. . . . . .t„), where the 

coordinates xi, . . . ,Xn oix are real numbers. Addition and scalar multiplication 
on R" arc defined coordinatewise in the usual way, so that R" becomes a finite- 
dimensional vector space over the real numbers. 

A norm on R" is a function N{x) such that N{x) is a nonnegative real 
number for every x € R" which is equal to exactly when a; = 0, 

(3.1) N{rx) = \r\N{x) 
for every r G R and x G R", and 

(3.2) N{x + y)< N{x) + N{y) 
for every x,y € R". If A?' is a norm on R", then 

(3.3) dN{x,y)=N{x-y) 
is a metric on R". 

For example, the absolute value function is a norm on R, for which the 
corresponding metric is the standard metric on the real line. The standard 
Euclidean norm on R" is defined by 



n 

(3.4) N = (E-.') 
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and the corresponding metric is the standard Euclidean metric on R". It is not 
so obvious that this satisfies the triangle inequality, and hence is a norm, and 
we shall discuss a proof of this fact in Section 5. 
One can check directly that 

n 

(3.5) 11:^^111 = El^^l 
and 

(3.6) ||a;||oo = max(|a;i|, . . . , |a;„|) 
are norms on R". We shall see in Section 5 that 

n ^1 

(3.7) II^L=(El^^l') ' 

is a norm when p > 1, which includes the Euclidean norm as a special case. 
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4 Convex functions 



A real- valued function f{x) on the real line is said to be convex if 
(4.1) fit x + {l-t)y)<tf{x) + {l-t) f{y) 

for every x, y £ R and t G [0, 1]. This is equivalent to 

(4 2) /(^) - m < fjy) - fM 

w — X ~ y — w 

for every a;,w,y e R such that x < w < y. Applying this condition twice, we 
get that 

(4 3) fM - fix) ^ fiz) - fiy) 

w — X ~ z — y 

when X < w < y < z. As another refinement of (4.2), one can use (4.1) to show 
that 

(4.4) /H - /(^) < m - m < fjy) - /H 

w—x ~ y—x ~ y—w 

when X < w < y. 

If / is differentiable and /' is monotone increasing, then the mean value 
theorem implies (4.2) and hence that / is convex. Conversely, (4.3) implies that 
the derivative of / is monotone increasing when / is differentiable. Actually, one 
can show that the right and left derivatives f^ix), f'_ix) exist for each a; G R 
when / is convex, and satisfy 

(4.5) /i(x)</;(x) 
and 

(4.6) r+ix) < f'_iy) 

when X < y. One can also show that these conditions characterize convexity, 
using analogues of RoUe's theorem and the mean value theorem for functions 
with one-sided derivatives. 

A function / : R — > R is strictly convex if 

(4.7) fitx + i\-t)y)<t fix) + (1 - t) /(y) 

when X y and < i < 1. This corresponds to strict inequality in (4.2), (4.3), 
and (4.4) as well. If / is differentiable on R, then / is strictly convex if and 

only if /' is strictly increasing. Otherwise, strict convexity can be characterized 
in terms of one-sided derivatives by the requirement that 

(4.8) f'+ix) < f'_iy) 

when X < y. Alternatively, if a convex function / on R is not strictly convex, 
then / is equal to an affine function on an interval of positive length. 

For example, consider /(r) = \r\^, p > 0. If p = 1, then /(r) = |r| is convex 
but not strictly convex on R. If p = 2, then /(r) = is twice-differentiable. 
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f"{r) ~ 2, and / is strictly convex. If p > 2, then / is twice-differentiablc on R, 
/"(r) > when r 0, /"(O) = 0, and / is strictly convex because /' is strictly 
increasing. If 1 < p < 2, then / is different iable on R, twice-differentiable on 
R\{0}, f"{r) > when r 7^ 0, and again / is strictly convex since /' is strictly 
increasing. If < p < 1, then / is twice-differentiable on R\{0}, /"(r) < 
when r 7^ 0, and / is not convex. 



5 Convex sets 

A set C R" is said to be convex if 

(5.1) tx + {l-t)y€E 

for every x,y & E and t G (0, 1). For example, open and closed balls associated 
to metrics defined by norms on R" are convex. 

Conversely, suppose that N{x) is a nonnegative real- valued function on R" 
such that iV(a;) > when x and the homogeneity condition (3.1) holds for 
all x G R" and r e R. If the closed unit ball 

(5.2) Bn = {x€R" : N{x) < 1} 

is convex, then N satisfies the triangle inequality (3.2) and hence is a norm. Let 
x,y & R" be given, and let us check (3.2). We may suppose that x,y ^ 0, since 
the inequality is trivial when a; = or y = 0. Put 

so that N{x') = N{y') = 1. By hypothesis, 

(5.4) N{tx' + {l-t)y') <1 

when < t < 1. Applying this with 

N{x) 



(5.5) 



N{x) + N{y)' 



we get (3.2), as desired. 

For example, suppose that A'' (a;) = ||a;||p, 1 < p < 00. Let x,y € R" with 
< 1 be given, so that 

n n 

(5.6) i^k.r ^i%r<i. 

We would like to show that 

(5.7) \\ta:+{l-t)y\\p<l 
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when < i < 1, which is the same as 

n 

(5.8) + <1. 

The convexity of |r|^ on R implies that 

(5.9) \tx, + (1 - t) %r < tl^j]" + (1 - i) lyjl" 

for each j, and the desired inequaUty follows by summing this over j. 

A norm TV on R" is said to be strictly convex if the unit ball is strictly 
convex in the sense that 

(5.10) N{tx + {l-t)y) <1 

when x,y € R", N{x) = N{y) = I, x y, and < t < 1. It is easy to see 
that the absolute value function is strictly convex as a norm on R, if not as a 
general function as in the previous section. One can also check that ||a;||p is a 
strictly convex norm on R" when p > 1, using the strict convexity of \r\P on R 
and computations as in the preceding paragraph. However, ||a;||i and || 

^ 1 1 oo ^re 

not strictly convex norms on R" when n > 2. 



6 A little more calculus 

Let / be a continuous real- valued function on a closed interval [a, b], a <b. The 
integral 

(6.1) / f{t)dt 

J a 

can be defined in the usual way as a limit of finite sums. The convergence of 
the finite sums to the integral uses the fact that continuous functions on [a, h] 
are actually uniformly continuous. It is well known that continuous functions 
on compact subsets of any metric space are uniformly continuous. 
Consider the indefinite integral 

(6.2) F{x)= r f{t)dt. 

J a 

This defines a continuous function on [a, b] which is differentiable on (a, b) and 
satisfies F'{x) = f{x). Similarly, F has one-sided derivatives at the endpoints 
a, b that satisfy the same condition. If another differentiable function on (a, b) 
has derivative /, then the difference of F and this function is constant, by the 
mean value theorem. 

Clearly F is monotone increasing on [a, 6] if /> on the whole interval. 
If / > on [a, 6], then F is strictly increasing. The same conclusion holds if 
/ > on [a,b] and / > at some point in any nontrivial subinterval of [a, 5]. 
Equivalently, if / > on [a, 6], and if F is not strictly increasing on [a, b], then 
/ = at every point in a nontrivial subinterval. 



7 



Suppose that /' > on {a,b), or simply that / is monotone increasing on 
[a, b] . This imphes that 

(6.3) «,)<£M:i|W</(„) 

when a < X < y < b. In particular, 

F{w) - F{x) ^ F{y) - F{w) 

w — X ~ y — w 

when a<x<w<y<b. If / is strictly increasing, then these inequalities are 
strict as well. 



7 Supremum and infimum 

A real number b is said to be an upper bound for a set A C R if a < 6 for every 
a G A. Wc say that 6i G R is the least upper hound or supremum of A if bi 
is an upper bound for A and bi < b for every upper bound 6 of ^. If 62 € R 
also satisfies these two conditions, then bi < 62 and 62 < and hence 61 =62- 
Thus the supremum of A is unique when it exists, in which case it is denoted 
sup A. The completeness property of the real line states that every nonempty 
set with an upper bound has a least upper bound. 

More precisely, this is completeness with respect to the ordering on the real 
line, which can be defined for other ordered sets. There is also completeness for 
metric spaces, which means that every Cauchy sequence converges. Both forms 
of completeness hold on the real line, and are basically equivalent to each other 
in this particular situation. However, the two notions are distinct, because they 
can be applied in different circumstances. There are completeness conditions 
concerning the existence of solutions of ordinary differential equations as well, 
which may be related to completeness for an associated metric space. 

Similarly, a real number c is said to be a lower bound for ^ C R if c < o for 
every a E A, and ci G R is the greatest lower bound or infimum of A if ci is a 
lower bound for A and c < ci for every lower bound c of A. This is unique when 
it exists for the same reasons as before, and is denoted inf A. It follows from 
completeness that a nonempty set A C R with a lower bound has a greatest 
lower bound, which can be characterized as the supremum of the set of lower 
bounds of A. Alternatively, the infimum of A is equal to the negative of the 
supremum of —A = {—a : a G A}. 



8 Bounded sets 

Let (M, d{x, y)) be a metric space. A set E C M is said to be bounded if there 
is a p G M and an r > such that 

(8.1) d{p,x)<r 
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for every x £ E. This implies that for every q G M there is a t > such that 
d{q, x) <t for every x G E,hy taking t = r + d{p, q). 

Equivalently, E C M is bounded if the set of distances d{x, y) foi x,y G E 
has an upper bound in R. If E is nonempty and bounded, then the diameter 
of E is defined by 

(8.2) diamE = sup{d{x,y) : x,y e E}. 

The diameter of the empty set may be interpreted as 0. 

U El C E2 ^ M and E2 is bounded, then Ei is bounded, and 

(8.3) diamiJi < diam£'2. 

The union of two bounded subsets of M is also bounded, but the diameter of 
the union may be much larger than the sum of the diameters of the two subsets. 

Suppose that M is R" equipped with a norm N and its associated metric 
rfjv(a;, y). The convex hull E oiasct E C R" consists of all convex combinations 
of elements of E. More precisely, E is the set of all finite sums of the form 

k 

(8.4) E^^^W' 

i=l 

where fc is a positive integer, n,. . . ,rk are nonnegative real numbers such that 

k 

(8.5) ^rk = l, 

i=l 

and a;(l), . . . , x{k) are elements of E. It is well known that one can take k — n+1 
here, but wc shall not need this fact. By construction, i? is a convex set in R" 
that contains E. Moreover. E is the smallest such set, in the sense that E is 
contained in any convex set in R" that contains. If E is bounded, so that E 
is contained in a ball, then E is contained in the same ball, and hence E is 
bounded. Let us check that 

(8.6) diam^rii^ < diamAri?, 

where the subscript N indicates that the diameter uses the norm N. Let 

fe I 
(8-7) ^ = Y.rix{i), rj = J2tjy{j) 

be arbitrary elements of E, as before. Thus 

fc ; 

(8-8) = 

and therefore 

fc ( 

(8.9) N{^-7i)<J2J2 ^(^(^) - 

i=i j=i 
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by the properties of norms. This imphes that 

(8.10) -V)< max{7V(x(i) - y{j)) : 1 < i < k, 1 < j < I}, 

and consequently — rf) < dianijv-B, as desired. 

9 Lipschitz mappings 

Let (Ml, di(a;, y)) and [M^, d2{u, v)) be metric spaces. A mapping / : Mi — »• M2 
is said to be Lipschitz if 

(9.1) d2{f{x),f{y))<Cdi{x,y) 

for some C > and all x,y ^ M . More precisely, this means that / is Lipsehitz 
of order 1, and we shall discuss other Lipschitz conditions later. One can also 
say that / is C-Lipschitz or C-Lipschitz of order 1 to mention the constant C 
explicitly. 

Thus / is C-Lipschitz with C = if and only if / is constant. Note that 
Lipschitz mappings are uniformly continuous. Suppose that (M3,d3(w, z)) is 
aiiotlier nic;tric space, and that /i : A/i —t M2 and /2 ; M2 M3 are Lipschitz 
mappings with constants Ci, C2, respectively. The composition /2 o fi is the 
mapping from Mi to M2 defined by 

(9.2) (/2 0/i)(x) = /2(/l(x)), 

and it is easy to check that this is Lipschitz with constant equal to the product 
of Ci and C2- 

If / : Ml — > M2 is C-Lipschitz and C Mi is bounded, then 

(9.3) f{E) = {f{x) :xeE} 
is bounded in M2, and 

(9.4) diam2/(£) < Cdiami^;. 

Here the subscripts indicate in which metric space the diameter is taken. This 

is easy to verify, directly from the definitions, and suggests another way to look 
at the composition of Lipschitz mappings, as in the previous paragraph. 

10 Real- valued functions 

Let / be a real- valued function on an open interval (a, b) in the real line. If / is 
C-Lipschitz with respect to the standard metric on the domain and range, then 

(10.1) \nx)\ < c 

at every point x G (a, b) at which / is differentiable, by definition of the deriva- 
tive. Conversely, if / is differentiable and satisfies this condition everywhere on 
(a, 6), then / is C-Lipschitz, by the mean value theorem. 
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Now let {M, d{x, y)) be a metric space. A function / : M — > R is C-Lipschitz 
with respect to the standard metric on R if and only if 

(10.2) f{x)<f{y) + Cd{x,y) 

for every x, y G M. This follows easily from the definitions. In particular, 
fp{x) = d{p, x) is 1-Lipschitz for every p e M. 
If A C M, A 7^ 0, and x e M, then put 

(10.3) dist(.T, A) = inf{d(a;, a) : a € A}. 
For each x,y £ M and a £ A, 

(10.4) dist(a;, A) < d{x, a) < d{x, y) + d{y, a), 
and therefore 

(10.5) dist(.x, A) < dist(y, A) + d{x, y). 

This shows that dist(x,^) is 1-Lipschitz on M. 

Suppose that /i, /2 : M — > R are Lipschitz with constants Ci, C2, respec- 
tively. For any ri , r2 £ R, /i -I- r2 /2 is Lipschitz with constant | ri | Ci -|- 1 r2 1 C2 • 
Suppose also that /i, /2 are bounded on M, with 

(10.6) |/i(a;)| < fci, \h{x)\ < 
for some fci , > and every x € M. Because 

(10.7) h{x) h{x) - h{y) h{y) 

= (hix) - h{y)) f2{x) + My) {f2{x) - f2{y)) 

for every x,y £ M, fi /2 is Lipschitz on M with constant k2 Ci + C2. 



11 R*^- valued functions 

Let N he a, norm on R". Thus A'' is 1-Lipschitz as a real- valued function on R" 
with the metric dNix,y) associated to N, as in the previous section. One can 
also show that A'' is bounded by a constant multiple of the standard Euclidean 
norm on R". This uses the finite-dimensionality of R" in an essential way, and 
it implies that A^ is Lipschitz with respect to the standard metric on R". 

Suppose that / is a continuous R"-valued function on a closed interval [a, b] 
in the real line. As an extension of the triangle inequality for N, 

(11.1) n( f f{t)dt) < f N{f{t))dt. 

Indeed, the analogous statement for the finite sums follows from the triangle 
inequality for A^. The integral of / can be approximated by finite sums, and 
continuity of A" as in the preceding paragraph can be employed to pass to the 
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limit. Alternatively, one can use duality, as follows. For any linear functional 
^ : R" ^ R, 

(11.2) ct>(^jj{t)dt)= j^4^{f{t))dt. 

If |0(t«)| < N{w) for every w € R", then we get that 

< f N{f{t))dt. 

J a 

A famous theorem states that for each v € R" there is such a (f) with (t){v) = 
N{v), which permits one to estimate the norm of the integral. Wc shall not 
discuss the proof of this hero, but one can take (^{w) to be the standard inner 
product of w with v/\v\ when v ^ and N is the Euclidean norm on R", and 
there are also explicit expressions for (p when N{w) = \\w\\p, 1 < p < oo. 

Suppose now that F : [a, b] R" is C-Lipschitz with respect to the standard 
metric on R and the metric on R". If F is differentiable at a point x S (a, b), 
then N{F'{x)) < C. This follows from the definition of the derivative, as in 
the real-valued case. Conversely, if F is continuously difi^erentiable on [a, 6] 
and N{F') < C, then one can use the fundamental theorem of calculus and 
the integral form of the triangle inequality to show that that F is C-Lipschitz 
with respect to A''. One can use duality to get the same conclusion when F is 
continuous on [a,b] and diS'erentiable on (a, 5) with N{F') < C, by applying 
the mean value theorem to (f> o F for linear functionals (p ■ R" R- 

Let (M, d{x, y)) be a metric space, and let F = (Fi, . . . , F„) be a mapping 
from M into R". If R" is equipped with the norm Halloo, then it is easy to see 
that F is C-Lipschitz if and only if Fi , . . . , f „ are C-Lipschitz as real- valued 
functions on M. Of course, one can estimate Lipschitz conditions for F in terms 
of Lipschitz conditions for Fi , . . . , F,, for other norms on R" , and vice- versa, 
but the relationship between the constants is normally not quite as simple as 
for the norm HwHoo. 



(11.3) 



/ 



ct>{f{t))dt 



12 Bounded variation 

Let / be a real- valued function on a closed interval [o, b]. A partition of [a, b] is 
a finite sequence {tj}"^o ^^^^ numbers such that 

(12.1) a = to<ti<-- - <tn = h. 
For each partition V = {tj}^^Q of [a,b], consider 

n 

(12.2) yp(/) = ^|/(i,)-/(t,_i)|. 

This measures the variation of / on the partition V. We say that / has bounded 
variation on [a, b] if there is an upper bound for V-p{f) over all partitions V of 
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[a,b]. In this case, the total variation V^{f) of / on [a,b] is defined by 

(12.3) V^{f) = sup{Vp -.Visa partition of [a, b]}. 

Thus V^{f) = if and only if / is constant on [a, b]. 

Using the partition that consists of only a, b, we get that 

(12.4) |/(6)-/(a)|<yj'(/). 
If / is monotone increasing on [a, b] , then 

(12.5) Vr{f)=f{b)-f{a) 

for every partition V of [a, b]. Hence / has bounded variation on [a, b], and 

(12.6) V^{f)=m-f{a). 
Conversely, if / has bounded variation on [a, b] and 

(12.7) V^{f) = \m-f{a)\, 

then / is either monotone increasing or decreasing on [a,b]. 
If / is C-Lipschitz on [a,b], then 

(12.8) Vr{f)<C{b-a) 

for every partition V of [a, b]. Hence / has bounded variation on [a, b], and 

(12.9) V^{f)<C{b-a). 
If (f) : H —> H is C-Lipschitz, then 

(12.10) Vp{cf>of)<CVr{f) 

for every / : [a,b] — > R and partition V of [a,b]. If f has bounded variation on 
[a, b] , then it follows that ^ o / has bounded variation on [a, b] , and 

(12.11) V^i^o f)<CV^{f). 

Let /i, /2 : [a, b] ^ R and ri, r2 G R be given. For any partition V of [a, b], 

(12.12) Vvinfi+r2f2) < \ri\Vvi.fi) + \r2\Vvif2). 

If /i , /2 have bounded variation on [a, b] , then it follows that ri /i + r2 /2 also 
has bounded variation, with 

(12.13) V^in h + r2 ,/2) < \ri\V!:{h) + \r2\ V',{h)- 
Suppose that /i, /2 arc bounded on [a, 6], so that 

(12.14) |/i(x)| < fcl, \f2{x)\ < k2 
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for some fci, ^2 > and every x £ [a, b]. It is easy to check that 

(12.15) Vpifi /a) < k2 Vp{fi) + fci Vp{f2) 

for every partition V of [a,b]. If /i, /2 have bounded variation on [a,b], then 
/i /2 has bounded variation, and 

(12.16) V^ih h) < k2 V^ih) + h V^ih). 

This is analogous to the earlier estimate for the Lipschitz constant of the product 
of bounded Lipschitz functions, and to the Leibniz rule for differentiating the 
product of two functions. 

Suppose that ai, 6i are real numbers such that a < ai < bi < b. If / has 
bounded variation on [a, b], then / has bounded variation on [ai, and 

(12.17) K'U/)<K'(/)- 

This is because every partition of [ai, &i] can be extended to a partition of [a, b]. 
In particular, / is bounded on [a, b] when it has bounded variation. 

A partition V of [a, b\ is said to be a refinement of a partition V of [a, b] if 
V contains all of the terms in V. In this case, one can check that 

(12.18) Vv{J)<Vv{f) 

for every / : [a, b] R, using the triangle inequality. Also, any finite collection 
of partitions of [a, b] has a common refinement. 

Suppose that / has bounded variation on [a, 6], and that x S (a, 6). Thus 
the restrictions of / to [a, x] and to [x, b] have bounded variation, and moreover 

(12.19) V:{f) + V^{f) = V^{f). 

Indeed, any partitions T'l, V2 of [a,x], [x,b], respectively, can be combined to 
get a partition V3 of [a, b] for which 

(12.20) VvAf) + VvM) = VvAfl 

which implies that V^{f) + V^{f) < V^{f). To get the opposite inequality, note 
that every partition of [a, b] can be refined if necessary to contain x, and hence 
to be a combination of partitions of [a, x] and [x, b]. The same argument shows 
that / has bounded variation on [o, b] if it has bounded variation on [a, x] and 
on [x,b]. 

Suppose that / is continuously differentiable on [a, 6]. U a < r < t < b, then 

(12.21) |/(i)-/(r)| 



/ /'(Orf? < / l/'(0|rf6 

J r J r 



This implies that 

(12.22) Vv{f)< I 

J a 
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for every partition V of [a, h] . One can show that 



(12.23) V',{f)= I 

J a 



using very fine partitions V of [a,b]. 

For each r £ R, put r+ = r when r > and r+ = when r < 0, and 
r_ = — r when r < and r — = when r > 0, so that 

(12.24) r+ — r_ = r, r++r_ = |r|. 
Given / : [a, 6] ^ R and a partition V = {tjYj^Q of [a,b], put 

n 

(12.25) Pp(/) = ^(/fe)-/(i,_i))+ 
and 

n 

(12.26) 7Vp(/)=^(/(t,)-/(t,_i))_. 
Thus 

(12.27) Pp(/)+7Vp(/) = yp(/) 
and 

(12.28) Pr{f)-Nrif)^f(b)-f{a). 
Suppose that / has bounded variation on [a,b], and put 

(12.29) P^if) = sup{P-pif) : P is a partition of [a, b]} 
and 

(12.30) N^{f) = sup{N-p{f) -.Visa partition of [a, b]}. 
One can check that 

(12.31) P'aif) + Kif) = V^{f) 
and 

(12.32) Pa{f)-Kif) = fib)-fia). 
Similarly, 

(12.33) P^{f)-NS{f) = f{x)-f{a) 

when a < X < b. This implies that / can be expressed as the difference of 
two monotone increasing functions on [a, 6], since P^if), ^aif) monotone 
increasing in x. 

Functions of bounded variation do not have to be continuous, but they can 
only have jump discontinuities. More precisely, if / has bounded variation 
on [a,b], then / has one-sided limits from both sides at every point in (a, 6), 
and from the right and left sides at a, b, respectively. This follows from the 
analogous statement for monotone functions and the fact that a function of 
bounded variation can be expressed in terms of monotone functions, and it can 
also be shown more directly. 
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13 Lengths of paths 

Let (Af, d{x, y)) be a metric space, let a, b be real numbers with a < b, and let 
/ be a function on [a,b] with values in M. For each partition P = {tjjj^o of 

M, consider 

n 

(13.1) Ar{f) = J2d{fitj)J{tj-i))- 

This is the same as the variation Vp{f) oi f onV when M is the real line with 
the standard metric. If there is an upper boimd for A-p over all partitions V of 
[a, b], then we say that the path / : [a, b] M has finite length, and the length 
of the path is defined by 

(13.2) A^(/) = sup{Ap : -p is a partition of [a, b]}. 

This is the same as the total variation V^{f) of / when M = R. As in the 
previous case, A^(/) = if and only if / is constant. If a < r < t < b, then 

(13.3) d(/(r),/(f)) < Ap(/) 

for any partition V of [a,b] that contains r, t. Hence f{[a,b]) is a bounded set 
in M when / : [a, 6] — *■ M has finite length, with 

(13.4) diam/([a,5])<A^(/). 

Of course, f{[a,b]) is a compact set in M when / is continuous, and therefore 
bounded. If / is continuous, then /([a, 6]) is also a connected set in M. 
If f : [a,b] ^ M is C-Lipschitz, then / has finite length, and 

(13.5) Alif)<C{b-a). 

Let {M,d{u,v)) be another metric space, and suppose that : M — * M is 
C-Lipschitz. For any f : [a,b] ^ M and partition V of [a, b], 

(13.6) Ap(0o/)<CAp(/), 

where A is the analogous quantity for M. If / : [a, 6] — > M has finite length, 
then (p o f : [a,b] ^ M does too, and 

(13.7) A^(.^o/)<CA^(/). 

In particular, if / has finite length and : M ^ R is Lipschitz, then f has 
bounded variation. 

If f : [a,b] ^ M has finite length and a < ai < bi < b, then the restriction 
of / to [ai,6i] has finite length, and 

(13.8) A^^(/)<A^(/). 
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If V, V are partitions of [a, b] and V is a refinement of V, then 

(13.9) Ap(/)<Ap,(/) 

for any / : [a, 6] ^ M, as in the case of real- valued functions in the previous 
section. As before, one can use this to show that 

(13.10) A^(/) + A^(/) = A^(/) 

for every x G (o, 6) when / has finite length. If a < a; < 6, then 

(13.11) lira m.f) 

J/-»X + 

exists, because A^(/) is monotone increasing in x, and hence 

(13.12) lim sup{A^(/) : X < w < y} = 0. 

y—>x+ 

This implies that 

(13.13) lim diam/((x,y]) = 0, 

y—>x+ 

since 

(13.14) diam/((x, 2/]) = sup{diam/([u', y]) : x < w < y} 

< sup{Aj;,(/) :x<w<y}. 

If M is complete, then it follows that / has a limit from the right at x, and 
similarly there is a limit from the left when a < x < b. 

Suppose now that M is R", equipped with a norm N, and thus the metric 
dN{x,y) associated to N too. If /i,/2 : [a,b] — > R" have finite length and 
ri, r2 e R, then n /i + r2 /2 has finite length, and 

(13.15) A^(ri/i+r2/2) < |ri| A^(/i) + hi A^(/2). 

This is similar to the case of rcal-valucd functions, and one can also treat the 
product of a real- valued function and an R"-valued function on [a, b] in the 
same way as before. If / : [a, b] — > R" is continuously diff'erentiable, then one 
can show that / has finite length and that 

(13.16) At(/)= fN{f{0)d^, 

J a 

in practically the same way as before. It can be interesting to consider integral 
norms ^ 

(13.17) Nif'iord^y^" 

as well, 1 < p < 00. The p = oo case corresponds to the maximum of N{f') 
on [a,b]. This integral norm is especially interesting when p = 2 and N is 
the standard Euclidean norm on R". For other p, there is some simplification 
when N{v) = \\v\\p. If N{v) = \\v\\i, then the length of any path of finite 
length in R" is equal to the sum of the total variations of the coordinates of 
the path. This uses the fact that any finite collection of partitions of [a, b] has a 
common refinement, so that independent partitions for the coordinate functions 
are equivalent to using the same partition for the whole path. 
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14 Snowflake metrics 



Let a be a positive real number, with a < 1. For any pair of nonnegative real 

mimbcrs u, v, 

(14.1) {u + v)°' <u" +v°'. 
To sec this, observe that 

(14.2) max(u, v) < (m" + u")^/", 
and hence 

(14.3) u + v< niax(M, v)^-" (m" + -y") < (u" + w")i/«. 

Note that the inequality is strict in (14.1) when u,v > 0. 

If {M,d{x,y)) is a metric space, then it follows from (14.1) that d{x,y)°' is 
also a metric on M . This docs not change the topology of M, but it docs change 
the geometry. Many familiar examples of snowflake curves in the plane have 
approximately this type of geometry, for instance. 

Suppose that / : [a, 6] ^ M is a continuous path with finite length with 
respect to d{x,y)°'. This means that 

n 

(14.4) ^rf(/(i,),/(i,_i))«<A 

j=i 

for some A> and every partition {tj}^^Q of [a,b]. Let e > be given. By 
continuity and compactness, / is uniformly continuous, and so there is a (5 > 
such that 

(14.5) d{f{r)J{w))<e 
for every r, •u; G [a, b] such that |r — i^j < 6. Hence 

n 

(14.6) ^d{f{tj),f{tj.,))<e'--A 

when tj—tj-i < d for each j = 1, . . . , n. Every partition of [a, b] has a refinement 
with this property, which implies that the length A^(/) of / with respect to 
d{x.y) satisfies 

(14.7) A^(/)<ei-"A 

Thus ^aif ) — '-'i since e > is arbitrary, and / must be constant. 

15 Holder continuity 

Let {Mi,di{x, y)) and (M2, d^iw, z)) be metric spaces. A mapping / : Mi — » M2 
is said to be Holder continuous of order a, < a < 1, if 

(15.1) d2{f{x),f{y))<Cdi{x,yr 
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for some C > and every x,y G Mi. One might also say that / is Lipschitz 
of order a in this case, but it will be convenient to refer to this as a Lipschitz 
condition when a = 1 and Holder continuity when < a < 1. Similar names 
are sometimes used for other related conditions as well. 

As in the previous section, di{x,y)°' is a metric on Mi, and therefore / is 
Holder continuous of order a with respect to d\ {x, y) if and only if / is Lipschitz 
with respect to di{x,y)°'. Thus many basic properties of Holder continuous 
mappings follow from the corresponding statements for Lipschitz mappings. In 
particular, 



is a real- valued Holder continuous function of order a on Mi with C = 1 for 

each p e Ml . 

Let (M, d{x,y)) be a metric space, and consider the case of a continuous 
path f : [a.,h] ^ M. If / is C-Lipschitz, then / is Holder continuous of order a 
for each a G (0, 1) with constant C (6 — a)^^". Of course, / also has finite length 
< C {b — a) when / is C-Lipschitz. However, continuous paths of finite length 
need not be Holder continuous of any positive order, and there are couterex- 
amples already for monotone increasing real- valued functions. Similarly, Holder 
continuous paths may not have finite length. 

Suppose that / : [a, 5] M is Holder continuous of order a with constant 
C > 0. For each p > 0, f{[a,b]) is contained in the union of 0(p-i/") subsets 
of M with diameter < p, because [a,b] is the union of 0{p~^^") subintervals 
of length < {p/Cy^". This implies that the Minkowski dimension of f{[a,b]) 
is < 1/a, and hence the Hausdorff dimension of f{[a,b]) is < 1/a too. This is 
an analogue for Holder continuous paths of the fact that Lipschitz paths have 
finite length. 

16 Coverings 

If [a, b], [ai, 6i], . . . , [a„, 6„] are closed intervals in the real line such that 



(15.2) 



fp{x) = di{p,x) 



a 



n 



(16.1) 




then 



n 




More generally, if Ei, . . . ,En are bounded subsets of R such that 



n 



(16.3) 




then 



n 



(16.4) 
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Indeed, each Ej is contained in a closed interval of the same diameter. 

Let (M, d{x, y)) be a metric space, and lei A, Ex, ... , En be bounded subsets 
of M such that 

n 

(16.5) A<^[jEi. 
If A is connected, then 

n 

(16.6) diam A < diam Ej . 

To see this, remember first that continuous mappings send connected sets to 
connected sets. If / : M ^ R is continuous, then f{A) is an interval in the real 
line, which may be open, or closed, or half-open and half-closed. At any rate, 

n 

(16.7) diamR/(^) < ^ di3m^f{Ej), 

where the subscripts indicate that these are diameters in R. If / is C-Lipschitz, 
then 

n 

(16.8) diamR/(^) < C ^diami;^. 

The desired estimate follows by applying this to 1-Lipschitz functions of the 
form fp{x) ~ d{p,x), p € A. 

The hypothesis that A be connected is essential here. If A is a finite set with 
at least two elements, then diam A > 0, but A is contained in the union of finitely 
many sets with one element and thus diameter 0. Cantor's middle-thirds set in 
the real line has diameter equal to 1 , and is contained in the union of 2^ intervals 
of length for each £ > 1. A compact set A C H has Lebesgue measure 
exactly if for each e > there are finitely many bounded sets Ei, . . . , En C R 
such that A C Ej and J2'j=i diam Ej < e. 

In particular, if A C M is a bounded connected set and p > 0, then A is 
not covered by fewer than dinmA/p bounded subsets of M of diameter < p. 
Depending on the situation, many more of these subsets may be required. If M 
is R" equipped with the standard metric, for example, then a bounded set A 
can be covered by 0{p^"') sets of diameter < p. One needs at least a positive 
multiple of p^" such sets when A has nonempty interior, because otherwise the 
n-dimensional volume of A would be too small. 

17 Domains in 

A set [/ in a metric space M is an open set if for every p gU there is an r > 
such that B{p, r) C U. Any norm N on R" determines the same open sets 
as the standard metric. This is because N is less than or equal to a constant 
times the standard norm, and conversely the standard norm is less than or 
equal to a constant times N. The first statement can be checked directly by 
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expressing any element of R" as a linear eombination of the standard basis 
for R" and using the triangle inequality. As mentioned previously, this and 
the triangle inequality imply that N is continuous with respect to the standard 
norm. Hence the minimum of N is attained on the standard imit sphere in R", 
since the latter is compact. The standard norm times the minimum of N on 
the unit sphere is less than or equal to N on all of R", by homogeneity, which 
implies the second statement. For explicit norms like HwUp, 1 < f> < oo, the 
comparison with the standard norm can be verified directly. 

Suppose that U is a. connected open set in R", which is to say that U is not 
the union of two disjoint nonempty open sets. It is well known that U is then 
pathwise-connected, so that for every p,q & U there is a continuous mapping 
/ : [a, 6] — > f/ such that /(a) = p and f{h) = q. More precisely, one can even 
take / to be piecewisc-afRnc on [a, b]. In particular, / then has finite length. 

However, it is not clear how small the length of / can be. Of course, the 
length of / is at least the distance between p and q. If U is convex, then one 
can take / to be affine, and the length of / is equal to the distance between p 
and q. Otherwise, the length of / may have to be quite large compared to the 
distance between p and q. It is easy to give examples where this happens in 
the plane. For instance, there may be elements of U on opposite sides of the 
boundary locally. The boundary of U might also be complicated, with spirals 
or other obstacles. 

Even if the boundary of U is complicated, it may be that U behaves well in 
terms of lengths of paths. For example, if U is the region in the plane bounded 
by the von Koch snowflake, then every pair of elements of U can be connected 
by a path of length bounded by a constant multiple of the distance between 
them. The main idea is for the path to avoid the boundary as much as possible, 
without going too far away. There can also be relatively small parts of the 
boundary that only cause minor detours for paths in the domain. 

18 Lipschitz graphs 

Let k, I, n be positive integers such that k + I = n, and let us identify R" with 
R*^ X R', so that an element x of R" may be expressed as {x' , x"), where x' G R*^ 
and x" e R'. Also let A : R*^ — > R' be a continuous mapping, and consider its 

graph 

(18.1) {{x',x") en" :x" = A{x')}. 

This is a nice A:-dimensional topological submanifold of R". If /c = n — 1, 
then this hypersurface has two complementary components U+, U- consisting 

of {x',x") e R" such that x" > A{x') and x" < A{x'), respectively. If fc < n, 
then the complement of the graph in R" is connected. For any k, 

(18.2) {x',x")^{x',x" + A{x')) 

defines a homeomorphism on R" that sends the fc-plane x" = to the graph of 
A. If f{t) is a continuous path in R*^ parameterized by an interval [a,b], then 
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f{t) — {f{t), A{f{t)) is a continuous path in the graph of A. The graph of A is 
itself a curve in R" when k = 1. 

Suppose that A is Lipschitz. If / has finite length, then / does too, and 
the length of / is bounded by a constant multiple of the length of /. If the 
Lipschitz constant of A is small, then this constant multiple is close to 1. Using 
affine paths in R*^, we get that every pair of elements of the graph of A can be 
connected by a continuous path in the graph of A of finite length bounded by a 
constant multiple of the distance between them, where the constant multiple is 
close to 1 when A has small Lipschitz constant. 

A fc-dimcnsional submanifold of R" is locally the same as the graph of a 
continuously-differentiable mapping on R*^ with respect to a suitable choice of 
coordinate axes. By rotating the axes so that R*^ is parallel to the tangent plane 
of the submanifold at a particular point, the submanifold can be represented 
near the point as the graph of a function with small Lipschitz constant. The 
Lipschitz constant tends to as one approaches the point in question. Thus 
distances on submanifolds are approximately the same as the infimum of 
lengths of paths on the submanifold locally. 

19 Real analysis 

For the sake of simplicity, we have so far avoided referring to Lebesgue integrals 
and measure. However, this more sophisticated theory can be quite convenient 
in the present context. Let us mention some of the key points. 

A basic fact is that a monotone real- valued function on an open interval in the 
real line is differentiable "almost everywhere" , which is to say on the complement 
of a set of Lebesgue measure 0. Thus additional hypotheses of differentiability 
are sometimes superfluous. Unfortunately, even continuous monotone functions 
cannot necessarily be recoved from their almost everywhere derivative, as in 
the fimdamcntal theorem of calculus, without an extra condition of "absolute 
continuity" . Indeed, there are examples of nonconstant continuous monotone 
increasing functions with derivative equal to almost everywhere. 

It follows that a real- valued function of bounded variation on an interval in 
the real line is differentiable almost everywhere, since it can be expressed as 
the difference of two monotone increasing functions. In particular, a real- valued 
Lipschitz function on an interval is differentiable almost everywhere. Lipschitz 
functions are absolutely continuous, and so there is a version of the fundamental 
theorem of calculus for them. As corollaries of this fact, a Lipschitz function 
/ on an interval is constant if f'{x) = almost everywhere, / is monotone 
increasing if f'{x) > almost everywhere, and / is C-Lipschitz if < C 

almost everywhere. 

At the same time, bounded variation and Lipschitz conditions have natural 
extensions involving metric spaces, as we have seen. The composition of a path 
of finite length in a metric space with a real-valued Lipschitz function on the 
metric space is a function of bounded variation, which is Lipschitz when the 
path is. There are also a lot of real-valued Lipschitz functions on any metric 
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space. Even on R", there arc a lot of nice functions that are Lipschitz and not 
continuously differentiable, such as the distance to a point or to a set. 
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